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Abstract 

Coordinated information processing by the base stations of multi-cell wireless networks en- 
hances the overall quality of communication in the network. Such coordinations for optimizing 
any desired network-wide quality of service (QoS) necessitate the base stations to acquire and 
share some channel state information (CSI). With perfect knowledge of channel states, the base 
stations can adjust their transmissions for achieving a network-wise QoS optimality. In prac- 
tice, however, the CSI can be obtained only imperfectly. As a result, due to the uncertainties 
involved, the network is not guaranteed to benefit from a globally optimal QoS. Nevertheless, if 
the channel estimation perturbations arc confined within bounded regions, the QoS measure will 
also lie within a bounded region. Therefore, by exploiting the notion of robustness in the worst- 
case sense some worst-case QoS guarantees for the network can be asserted. We adopt a popular 
model for noisy channel estimates that assumes that estimation noise terms lie within known 
hyper-spheres. We aim to design linear transceivers that optimize a worst-case QoS measure in 
downlink transmissions. In particular, we focus on maximizing the worst-case weighted sum-rate 
of the network and the minimum worst-case rate of the network. For obtaining such transceiver 
designs, we offer several centralized (fully cooperative) and distributed (limited cooperation) 
algorithms which entail different levels of complexity and information exchange among the base 
stations. 
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1 Introduction 



The increasing demand for accommodating more users within wireless networks makes such net- 
works interference limited. Multi-cell multiuser networks suffer from intra-cell interference as a 
consequence of having the base stations serve multiple users simultaneously, as well as from inter- 
cell interference among neighboring cells due to their ever shrinking sizes. A useful approach for 
mitigating the interference in downlink transmissions is to equip the base stations with multiple 
transmit antennas and employ transmit precoding. Such precoding exploits the spatial dimension 
to ensure that the signals intended for different users remain easily separable at their designated 
receivers. For enabling precoded transmission, the base stations should acquire the knowledge of 
channel states or channel state information (CSI). 

The problem of designing linear precoders for single-cell downlink transmissions has been ex- 
tensively investigated in the literature. Assuming perfect knowledge of channel states at the base 
station, different transmission optimization schemes (e.g., power optimization, power-per-antenna 
optimization and max-min rate optimization) were investigated for designing linear precoders [IHS] ■ 
On the other hand, recent works on precoder design for multi-cell downlink transmissions assumed 
that both data and channel state information of all users can be perfectly shared among base sta- 
tions in real-time [THS]. In [3[8], coordinated base stations are simply regarded as a single large 
array with distributed antenna elements so that previously known single-cell precoding techniques 
can be applied to this scenario in a fairly straightforward manner. Instead, a more realistic model 
is considered in |9j which accounts for the fundamentally asynchronous nature of the interference 
due to the different propagation delays from the many base stations to each mobile. Practical 
concerns on the complexity of the network infrastructure and synchronization requirements may 
permit coordination only on a per-cluster basis as suggested in [lOpilj . Also, the limited bandwidth 
of the backbone network connecting the base stations may prevent real-time data sharing; in this 
case, each user can be served by only one base station, but the set of downlink precoders can still 
be optimized based on the inter-cell channel qualities [12^113] . 

In practice, however, a base station can acquire only imperfect CSI which is contaminated 
with unknown errors. Motivated by this practical premise, the paradigm of robust optimization 
has been recently employed to address the problem of precoder designs for single-cell downlink 
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transmissions |14H17| . Imperfect CSI might be due erroneous channel estimation or quantization 
errors. For the former one, the uncertainty region of the CSI errors is modeled probabilistically, 
where it is assumed to be unbounded and distributed according to some known distribution. For 
the latter one, the uncertainty region of the CSI perturbations is assumed to be bounded. |14j 
and [15] consider the probabilistic model and aim at optimizing a utility function by averaging 
over the entire uncertainty region. On the other hand, when the CSI noise terms are bounded, a 
promising approach is to design the precoders that yield worst-case guarantees, i.e., ensure worst- 
case robustness. Based on this notion, [16] examines the problems of mean-squared error (MSE) 
and signal-to-interference-plus-noise ratio (SINR) optimization for the multiple-input single-output 
(MISO) single-cell multiuser downlink transmission and [T7] considers the problem of MSE opti- 
mization for multiple-input multiple-output (MIMO) single-cell multiuser downlink systems. 

In this paper we consider the more general model of multi-cell wireless networks, which hitherto 
has not been investigated for robust optimization, and treat the problem of joint robust transmission 
optimization for all cells. The significance of such multi-cell transmission optimization is that it 
incorporates the effects of inter-cell interference which is ignored when the cells optimize their 
transmissions independently. Furthermore, we incorporate a practical constraint which forbids real- 
time data sharing among base stations so that each user can be served by only one base station. 
We adopt the bounded CSI noise model and design transceivers that optimize a worst-case quality 
of service measure. In particular, we focus on maximizing the worst-case weighted sum-rate of the 
network and the minimum worst-case rate of the network. For obtaining such transceiver designs, 
we offer centralized and distributed algorithms with different levels of complexity and information 
exchange among the base stations. We also show that these problems can be translated into or 
approximated by convex problems that can be solved efficiently as semidefinite programs (SDP) 
with tractable computational complexity. 

The remainder of the paper is organized as follows. In Section [2] we provide the system model. 
The formulations of the minimum worst-case rate and worst-case weighted sum-rate problems are 
discussed in Section [3] and are treated in Sections H] and [5l respectively. We also offer distributed 
algorithms with limited cooperation and information exchange among the base stations in order 
to solve these problems. For the weighted sum-rate problem, we also provide some comments on 
the rate the the sum-rate scales with increasing SNR. Simulation results are given in Section [7] and 
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Section [8] concludes the paper. 



2 Transmission Model 

We consider a multi-cell network with M cells each with one base station (BS) that serves K 
users. The BSs are equipped with N transmit antennas and each user has one receive antenna. We 
denote B m as the BS of the m th cell and Uj^ as the k th user in the m th cell for m G {1, . . . , M} 
and k G {1, . . . , K} . We assume quasi-static flat-fading channels and denote the downlink channel 
from B n to U*, by h^ n eC M . 

Let x m = [x^, . . . ,x^] T G C Kxl denote the information stream of B m , intended for serving 
its designated users via spatial multiplexing and assume that E[a; m a;^] = i". Prior to transmission 
by B m , the information stream x m is linearly processed (precoded) by the precoding matrix <& m G 
C NxK . While non- linear precoding approaches can offer near optimal performance, they are not 
viable in practice. Alternatively, linear precoding approaches can achieve reasonable throughput 
performance (i.e., a small sub-optimality gap) with considerably lower complexity relative to non- 
linear precoding approaches and hence is the route adopted by emerging wireless standards such 
as 3GPP LTE and IEEE 802.16m. 

We denote the k th column of 3> m by G C Nxl which is the beam carrying the information 
stream intended for user U^. By defining G C \ {0} as the single-tap receiver equalizer deployed 
by U^j, the received post-equalization signal at is given by 

1 M 

= ~7k~ ( ^m.n^n^n + Z m J , (1) 
Im n=1 

where ~ CN(0, 1) accounts for the additive white complex Gaussian noise. We assume that the 
users deploy single-user decoders for recovering their designated messages while suppressing the 
messages intended for other users as Gaussian interference. Therefore, the SINR of user (with 
the optimal equalizer) is given by 

S\NR k m = - h \h k m , m w k m \ 2 (2) 

Z^iljik \' l m,m u 'm\ T Z-m/m i—il \"'m,n UJ n\ ~ x 

Also we define MSE^ as the mean square-error (MSE) of user when it deploys the equalizer 



4 



/ m , and it is given by 

MSE fc = E\\v k -x k l 2 l = 1 I \h k w k - f k \ 2 + w l \ 2 + s T s T\h k w l I 2 + 1 I 

IJml \ l^k n^m I J 

(3) 

We further define MSE^ as the MSE corresponding to the minimum mean-square error (MMSE) 
equalizer which minimizes the MSE over all possible equalizers, i.e., 

MSE^minMSEt (4) 

f m 

We assume that user perfectly knows its incoming channels, which includes all the channels 
{/i^j n } for all choices of n, k. In contrast, each BS can acquire only noisy estimates of such chan- 
nels corresponding to its designated receivers, i.e., B m knows the channels {/i^ n }fc,n imperfectly. 
We denote the noisy estimate of the channel h 1 ^ n available at B m (and possibly other BSs via 

~ k 

cooperation) by h m n and define the channel estimation errors, which are unknown to the BSs, as 
A m , n = h m ,n - km,n , V m, n G {1, . . . , M} , and V k G {1, . . . , K}. (5) 

We assume that such channel estimation errors are bounded and confined within an origin-centered 
hyper-spherical region of radius e k nn , i.e., ||A^ n ||2 < e^n- There are two popular approaches for 
modeling the channel estimation errors: bounded and deterministic, and unbounded and random 
with known distribution. The reasons for selecting the bounded model can be summarized as 
follows. In the process of acquiring the channel state information (CSI) by the base-stations, 
there are two sources that induce uncertainty about the CSI. In this process the channels are first 
estimated by the mobiles and then are quantized and fed back to the base-stations. Therefore, there 
are two sources of CSI errors: estimation error and the quantization error. When the estimation 
is accurate enough but the amount of feedback bits available for feeding back the CSI (which 
determine the size of the quantization codebook) is limited, the quantization error will be dominant 
error term. On the other hand, when there is no limit on the feedback rate and the channel 
estimates are not very accurate, then the error terms are dominated by the estimation errors In the 
emerging standards for the next-generation cellular wireless networks, the amount of bits reserved 
for quantized CSI feedback is small. Small number of feedback bits implies a coarse quantization 
codebook and hence deteriorates the quantization accuracy. Therefore, with a good estimator, 
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there exists a high likelihood that the quantization error is dominant source of uncertainty about 
the CSI. Taking into account that the quantization errors are bounded, the model adopted here 
for the uncertainties about the CSI is justified. It is also noteworthy that an issue with using the 
probabilistic model for the CSI perturbation in practical systems is identifying the right distribution 
since the Gaussian assumption on the estimation error need not be well justified. Finally, once a 
suitable distribution is identified for the probabilistic model, we can first find the probability that 
the CSI perturbations fall outside a bounded region (outage probability). Employing the outage 
probability in conjunction with the the analysis provided in our paper for the bounded CSI error 
model can also provide a direction for analyzing the systems with a probabilistic model for the CSI 
perturbation. We also note that all the results derived in the sequel can be readily extended to the 
case where the uncertainty regions are bounded hyper-ellipsoids. 

In the sequel, for any matrix A we use ||A||2 to denote its Frobenius norm. 

3 Problem Statement 

We consider multi-cell downlink transmission with imperfect channel state information at the trans- 
mitter (imperfect CSIT). The existing literature concentrates mostly on the single-cell downlink 
system with imperfect CSIT. The significance of investigating multi-cell networks pertains to the 
fact that it allows us to design and optimize the network as an integrated entity. Optimizing 
each cell individually ignores the impact of the cells on each other's performance, which in turn 
prevents from achieving a network-wise optimality. Also, analyzing multi-cell systems is not a 
straightforward generalization of the approaches known for the single-cell systems since there are 
two new challenges; there exists an additional source of interference, i.e., inter-cell interference. 
Moreover, some level of coordination/cooperation among the base-stations of different cells must 
be introduced. 

We strive to optimize two network-wide performance measures through designing the precoding 
matrices {3? m } and the receiver equalizers {/^}. Such optimization heavily hinges on the accuracy 
of channel estimates available at the BSs. Due to the uncertainties about channels estimates, 
we adopt the notion of robust optimization in the worst-case sense [18tll9|. The solution of the 
worst-case robust optimization is feasible over the entire uncertainty region and provides the best 
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guaranteed performance over all possible CSI errors. 

Based on this notion of robustness, we treat the following rate optimization problems. One 
pertains to maximizing the worst-case weighted sum-rate of the multi-cell network and the other 
one seeks to maximize the minimum worst-case rate in the network. Both optimizations are subject 
to individual power constraints for the BSs. Let us define R m as the rate assigned to user and 
denote the power budget for the BS B m by P m . We also define P = [Pi, ... , Pm] as the vector of 
power budgets. 

First we consider the robust max-min rate problem which aims to maximize the minimum worst- 
case rate of the network subject to the power budget P. Since the users are deploying single-user 
decoders, we have R^ = log(l + SINR^J. Therefore, this problem can be posed as 

max($ m } min fcjm min^^ j SINR^ 



S(P) = { x ~ mJ * "* . (6) 

^m||2 — Pfn 



S-t. ll*mllo < -Pm Vm 



As the second problem, we consider optimizing the worst-case weighted sum-rate of the network. 
For a given set of positive weighting factors {a m }, where a m is the weighting factor corresponding 
to the rate of user U m , this problem is formalized as 



K(P) = 



EM v^^v k nk 

m=l 22k=l a m Pm ^ 



S.t. Il^mlli — Pm ^ m 



The weighted sum-rate utility function is most appropriate for the situations when we are 
interested in maximizing the network throughput while ensuring long term fairness. This utility 
does not include any hard minimum rate constraints for the individual users. In an extreme case, 
for the benefit of the aggregate network throughput, some users might not be assigned any resource 
over some scheduling frames. Thus, such a utility function is useful when the users are delay tolerant 
(as in the case of best effort traffic) and would allow to be turned off or take small resources over 
some frames for the benefit of the network performance. By incorporating the weighting factors we 
can induce different priorities for the users. Moreover, these weights can be adapted over time to 
ensure long-term fairness. On the other hand, the notion of max-min rate optimization, which in 
each scheduling frame maximizes the rate of the weakest user in the network, guarantees a stricter 
(short-term) fairness among the users but at the cost of degraded network sum rate. Note that an 
optimal solution for the max-min rate utility will assign identical rates to all users. 



In addition to solving the aforementioned optimization problems, we also analyze the degrees 
of freedom available in the multi-cell networks with imperfect CSIT. The degrees of freedom metric 
has emerged as a popular tool for analyzing the sum-rates of the wireless networks in the asymptote 
of high SNRs and provides good insight into the sum-capacity of these systems for which the precise 
characterization of the capacity region is unknown. However, existing works assume perfect CSI 
in determining the available degrees of freedom. Instead, in this work we analyze the achievable 
degrees of freedom when only imperfect CSI is available. 

We remark that the joint design of the optimal precoders may require that each BS acquires 
global CSI, which necessitates full cooperation (including full CSI exchange) among the BSs. In 
Sections U] and [5] we provide centralized algorithms assuming that such full cooperation is feasible. 
In practice, however, full cooperation might not be implementable. In such cases we have to resort 
to distributed algorithms, that entail limited cooperation among the BSs. Towards this end, we 
also offer distributed algorithms in Sections d] and [5] that involve limited information exchange 
and coordination among the base-stations, which in one instance comes at the cost of degraded 
performance compared to the corresponding centralized counterpart. 

4 Robust Max-Min Rate Optimization 

4.1 Single-user Cells (K = 1) 

In this subsection we assume that each BS is serving one user, i.e., K = 1. Under this assumption, 
the downlink transmission model essentially becomes equivalent to a multiuser Gaussian interfer- 
ence channel with M transmitters and M respective receivers. For the ease of notation we omit 
the superscript k in the subsequent analysis and discussions. When K = 1 the precoder of BS 
B m consists of only one column vector which we refer to by w m . For the given channel estimates 
{h m ,n} we define sinr m = mm^ mn y SINR m as the worst-case (smallest) SINR m over the uncertainty 
regions. 

A remark is warranted on the distinction between multi-cell single-user networks and single-cell 
multiuser networks. The fundamental difference is due to the different types of interference in 
these two systems. In single-cell multiuser systems there exists only intra-cell interference and is 
handled by a single transmitter (base-station). Base-station, knowing the downlink channels of all 



S 



users, can jointly design the beamformers for all users. In the multi-cell single-user system, which 
can be also considered as a multi-user Gaussian interference channel, the users are facing only 
inter-cell interference. In these systems, as opposed to the single-cell systems, there are multiple 
transmitters (base-stations). Jointly designing the beamformers for all the users, therefore, requires 
some coordination and information exchange among the base-stations. 

By introducing a slack variable a > 0, the epigraph form of the robust max-min rate optimization 
problem S(P) given in ([6]) is given by 



S(P) 



max {w m },a a 

s.t. 



sinr m > a Vm, 



(8) 



w 



a || 2 — Pm VjTi. 



We proceed by finding the closed- form characterization of sinr m . By recalling ([2]) we have 



\h 



sin iv 



mm 



mm All 



\h 



(9) 



{ A m ,„} Y.n+m I h m,nW n | 2 + 1 YLn+m max A m , n | h m , n W n | 2 + 1 ' 

where the second equality holds by noting that the channels h m j and h m j for i ^ j have indepen- 
dent uncertainties and finding the worst-case uncertainties can be decoupled. Therefore, finding 
the worst-case SINR m can be decoupled into finding the worst-case (smallest) numerator term and 
the worst-case (largest) denominator terms. In order to further simplify sinr m we use the result of 
the following lemma. The proof is straightforward and is a simple extension of a result in [20] which 
considers robust beamforming for a point-to-point link with colored interference at the receiver. 



Lemma 1 For any given h G C lxN , w € C Nxl , e G M + , and positive definite matrix Q; g m - m and 
5max defined as 



are given by 



minjc \hw + xw\ 2 



i.t. yJxQx H < e 



, and g n 



max^ I hw + xw 1 2 



hw\ — e\/w H Q ^^w 



i.t. y/xQx H < e 



and g n 



hw\ + e\/w H Q 1 w 



where (x) + = max{0, x}, V x G 
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By recalling and invoking the result of the lemma above for the choice of Q = /, sinr m can be 
further simplified as 

( | h mm W m | £m,m H^m || 2)^ 

smr m = — . (10) 

\\ w nh\ +1 

We are interested in the scenarios where V m, \\h m m \\ > € mm , so that S(P) > 0. Given the closed- 
form characterization of sinr m , solving S(P) can be facilitated by solving a power optimization 
problem defined as 



min {Wm}i6 b 

s.t. sinr m > a Vm, (11) 

< b Vm . 



In this problem the maximum weighted power consumed by the base-stations is minimized subject 



to a quality of service guarantee for all users. The constraint sinr m > a assures that all users receive 
a worst-case SINR which is not smaller than a and the constraint UHaP < guarantees that the 
maximum weighted power consumption is minimized. This can be considered as the dual of the 
problem which strives to maximize the quality of service for the users for a given power budget. 
The connection between S(P) and V{P,a) is established in the following useful theorem. 

Theorem 1 For any given power budget P , V(P, a) is strictly increasing and continuous in a at 
any strictly feasible a and is related to S(P) via 

V(P,S(P)) = 1. 

Proof: See Appendix [A] ■ 
Strict monotonicity and continuity of V(P, a) in a at any strictly feasible a provides that there 
exists a unique a* satisfying V(P,a*) = 1. Hence, taking into account Theorem [T] establishes 
that solving S(P) boils down to finding a* that satisfies V(P,a*) = 1. Due to monotonicity and 
continuity of V(P, a), finding a* can be implemented via a simple iterative bi-section search. Each 
iteration requires solving V{P,a) for a different value of a. We demonstrate that V(P,a) can be 
cast as a convex problem with a computationally efficient solution. 

Theorem 2 Problem V(P,a) can be posed as an SDP. 

Proof: See Appendix [Bl ■ 
Such a procedure for solving S{P) necessitates the BSs to be fully cooperative such that each BS 
can acquire estimates of all network-wide channel states. 
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4.2 Multi-user Cells (K > 1) 

In this subsection we consider downlink transmissions serving more than one user in each cell 
(K > 1). The major difference between the analysis for multiuser cells and that of single-user 
cells arises from the different characterizations of their corresponding worst-case SINRs. By defining 
sinr^ as the worst-case SINR^ in ©, we have 

sinr^ = mm ^ — ■ ^ — . (12) 

{ A m,nl ^2il-Lk V X m,m W m\ + En^m Xw I ^"m,n w n\ + 1 

Unlike the single- user setup, when K > 1 the numerator and the summands of the denominator of 
sinr^ have a common uncertainty term. Therefore, finding sinr^ cannot be decoupled into finding 
the worst-case numerator and the worst-case terms in the denominator independently. To the 
best of our knowledge, handling constraints on such worst-case SINRs even in single-cell downlink 
transmissions is not mathematically tractable |16j and the robust design of linear precoders for these 
systems is carried out suboptimally |16|I17|. In the sequel we also propose suboptimal approaches 
for solving the robust max-min rate optimization in multi-cell networks. 

We offer two suboptimal approaches for solving S(P). In the first approach we find a lower 
bound on the worst-case SINR and in the formulation of S(P) replace each worst-case SINR with its 
corresponding lower bound. Similar to the single-user cells setup in Section 14.11 this approximate 
problem can be solved efficiently through solving a counterpart power optimization problem. In the 
second approach, we convert the robust max-min rate optimization problem into a robust min-max 
MSE optimization problem and find an upper bound on the maximum worst-case MSE which in 
turn provides a lower bound on the minimum worst-case rate. 

4.2.1 Solving via Power Optimization 

The worst-case value of SINR^, which we denote by sinrj^, is not mathematically tractable. Conse- 
quently, we find lower bounds on the worst-case SINRs as follows. We define 

min A fc \h 1 L m w 1 L\ 2 

sinr — 



max A^ m EljLk \ h i,m wl J 2 + En^m maX A^. n El \Ki,n w n? + 1 ' 
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for which we clearly have sinr^ < sinr^. By applying Lemma [T] we find that 

\(\h^ nfi I — Wiifi llo^P 



smr r 



max Afc ^, m *m,fe(*m,fc)^(^m,m) H + En^m maX A fc frm,n*n(*n) H (/»m,n) H + 1 ' 

(13) 

where we have defined ^ m ,k = [ w m ■ ■ ■ > ^m -1 ' ^m^ 1 ) ■ • • > ^m]- 

By introducing the slack variable a and invoking the lower bounds on the SINRs given in (113p . 
a lower bound on the robust max-min rate is obtained as follows. 



Si(P) = 



s.t. sinr,^ > a Mm,k, (14) 



|*m||! < P m Vm. 



Similar to the single-user scenario, solving S\(P) can be carried out by alternatively solving a 
power optimization problem in conjunction with a linear bi-section search. The power optimization 
of interest with per BS power constraints is given by 

min{# m },6 b 

V 1 {P,a) = { s .t. imr*>a Vm,k, (15) 

< b Vm. 

V *Tn 

The result in Theorem [T] can be extended for multiuser cell setup (K > 1) in order to establish the 
connection between S±(P) and V\(P, a). The proof is omitted for brevity. 

Theorem 3 For any given power budget P, Vi(P,a) is strictly increasing and continuous in a at 
any strictly feasible a and is related to Si(P) via 

Vi(P,S 1 (P)) = l. (16) 

We construct Algorithm 1 which solves S\(P) by solving V\(P,a) combined with a bi-section 
line search. The optimality of Algorithm 1 and its convergence follows from the monotonicity and 
continuity of V\{P,a) at any feasible a. Similar to Theorem [2] we demonstrate that Vi(P,a) has 
a computationally efficient solution. 

Theorem 4 Problem V\{P,a) can be posed as an SDP. 

Proof: See Appendix [Cj ■ 
Note that when K = 1, Si(P) is identical to S(P) and an optimal solution to the latter problem 
is obtained using Algorithm 1. 
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Algorithm 1 - Robust Max-Min SINR Optimization via Power Optimization (K > 1) 



1: 


Input P and {h mi „, e^„} 


2: 


~ h 

Initialize a min = and a max = mm m ^{F m |(||^ m „J| 2 


3: 


a a min 


4: 


repeat 


5: 


Solve /^(.Pjao) and obtain {<P T n| 


6: 


if Pi(P,a ) < 1 


7: 




8: 


else 


9: 


"max «0 


10: 


end if 


11: 


a <— (flmin + Omax)/2 


12: 


until a max - a min < 5 


13: 


Output 5i(P) = a and {<&,„} 



4.2.2 Solving via MSE Optimization 

First we transform the robust max-min rate optimization problem into a robust min-max MSE 
optimization problem by using the fact that 

1 



MSE* 



1 + SINF4' 

where we recall that MSEj^ is the MSE of user when it deploys the MMSE equalizer. Conse- 
quently, the worst-case MSE corresponding to the worst-case SINR is given as 



1 1 



max MSE„ = j- = ;-. (17) 

{**,,„} 1 + mul {A*, n } SINR S» 1 + sinr* 



■ in 



By recalling the problem S(P) given in ([6]) and taking into account the representation of MSE^ 
given in (|4|) and the worst-case MSE given in (|17p . the robust max-min rate optimization problem 
can be solved by equivalently solving 



«5 2 (P) 



min {#m} max^max^^min^ MSE^ 
S.t. Il^mlli < Pm Vm. 
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We proceed by finding an upper bound on 52 (P) which in turn results in a lower bound on S(P). 
By invoking the inequality 

max min MSE^ < min max MSE^, (19) 

r A fe i fh fk (At l 

and introducing the slack variable a E M + , we find the following upper bound on 52 (-P). 



MP) 



s.t. max /Afc ,MSE^<a 2 VJfe,m, (20) 

||*m|ll < Pm V?Tl. 



This problem for single-cell MIMO downlink transmissions is studied in [T7] where it is solved 
suboptimally via an iterative algorithm based on the alternating optimization (AO) principle. Here 
we show that the problem in (|2L)p is equivalent to a generalized eigenvalue problem (GEVP) (c.f. [3]) 
which can be solved efficiently 

Theorem 5 The problem S2 (P) can be optimized efficiently as a GEVP. 

Proof: See Appendix [Dj ■ 
4.3 Limited Cooperation 

In this subsection we propose distributed algorithms for the networks not supporting full CSI ex- 
change between the BSs. The first distributed algorithm we propose (Algorithm 2) requires limited 
information exchange between the BSs and each BS designs its precoders independently of others. 
The cost incurred for enabling such limited information exchange and distributed processing is the 
degraded performance compared with the centralized algorithm. 

The underlying notion of this distributed algorithm is to successively update the precoder of one 
BS at-a-time while keeping rest unchanged. More specifically, at the m th iteration, all precoders 
{<& n } n ^ m are fixed and only BS B m updates its precoder by maximizing the worst-case smallest 



x Note that for any function f(x, y) we have min x max s f(x, y) > max s min^ f(x, y). To see this, define (xo, yo) = 
argmina; niax M f(x, y) and = arg max, min^ f(x, y). Therefore, f(x ,y ) = m&x y f(x , y) > f(x ,yi) > 

min x f{x,yi) = f{x 1 ,y 1 ). 
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rate of the m th cell. By recalling (|14p . the optimization problem solved by B m is given by 



max$ m>a a 

s.t. sinr?L > a Vfc 



I* lis < P 

I Y m || 2 — 1 mi 



(21) 



<& ra are fixed for n ^ m. 

Similar to the approach of Section 14.2.11 it can be readily verified that S\ m (P) can be solved 
through the following power optimization problem, 



Pi, m (P,a) = { 



min * m ,fe b 

s.t. sinr*L > a Vfc, 



l*mll: 



< b 



(22) 



<& n are fixed for n ^ m. 
which is connected to the original problem Si m (P) as follows. 

Corollary 1 For any given power budget P, T , i )7n (P,a) is strictly increasing and continuous in a 
at any strictly feasible a and is related to Si >m (P) via 



Vi,m(P,Si, m (P)) = l for m = l,...,M. 



(23) 



To compute sinr^ in (fl~3|) it is seen that B m needs to know W n = <& n 3>^, n ^ m. Using {VF n }, 
B m can solve S\ tTn (P) optimally and obtains Q* m through solving Vi f m(P,a) in conjunction with 
a linear bi-section search (Algorithm 2, line 8). Note that solving S± m (P) optimizes the minimum 
worst-case rate locally in the m th cell and does not necessarily leads to a boost in the network 
utility function. As a result, B m is allowed to update its precoder to <&* m only if such update results 
in a network- wide improvement (lines 9-13). 

The successive updates of the precoders continue until no precoder can be further updated 
unilaterally. The convergence to such point is guaranteed by noting that the algorithm imposes 
the constraint that B m can update its precoder only if it results in network-wide improvement. 
We also note that another variation of Algorithm 2 is also possible. In this variation, instead of 
fixing {1, • • • ,M} as the order of processing (i.e., the order in which the BSs attempt to update 
their precoders) as done in Algorithm 2, we can employ a greedy approach. In particular, at each 
iteration each BS can compute its precoder (assuming precoders of other BSs to be fixed). Then in 
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a bidding phase, each BS can broadcast its choice and only the choice which maximizes the network 
minimum worst-case rate is accepted by all BSs. 

We next propose another distributed algorithm that can optimally solve the optimization prob- 
lem in (|14|) by introducing more auxiliary variables and using dual decomposition. This algorithm 
involves a higher level of inter BS signaling than the previous distributed procedure. Such an ap- 
proach of introducing more auxiliary variables and using dual decomposition has been employed for 
the multi-cell uplink with perfect CSI in [21] and more recently in [22] over the multi-cell downlink 
also with perfect CSI. First, we re- write (I14p as 



s.t.- 



(I' l m,m' u 'm|- e m,mll' u) mll2) + 



|*m||| < P m Vm 



maX A^ n h t,n®n(*n) H (ht„ n ) H < (/? m ,n) 2 Vfc, 171 ^ U. 

To solve (|24p we propose a bi-section search over a in which for any fixed a, we solve the following 
problem 



mm 



Em ll^mlll 

S ^ ^, m * m , fc (* m , fc )««, m )«+E„ #m (^,J 2 +l " a ' ' 

I — X m. . m. 1 ' 



(25) 



| *m III <Pm Vm 



max Afc h k m $ n ($ n ) H (h k ) H < (/3^ n ) 2 Vm ^ n, k. 

Next, for each BS m, we define variables /3m™,^n,m which denote its copies of /3 mjn ,/3^, m , respec- 
tively. Also, let be the vector formed by collecting all such variables. Then, we can re-write 
(1251) as 



m[n {$m,f3 im) } Em II*™ II 2 
S.t 



I ~ k k k k 1 2 

( I , m w m I — e m , m 1 1 w m 1 1 2 ) 



max fc ^, m *m,fc(*m,fe) H (^, m ) H +En^ 



2 +i 



> a Vfc, m 



max AJ ^, m * ro (* ro ) H (^, m ) fl < («) 2 Vj,n/ 

" 77, . T77 1 



m 



(26) 



II* ||2 < p W™ 
II II 2 — Jr m v 

Pm,n = Pm,n VK, m ^ 71. 



Similar to the proof of Theorem [U it can be shown that (|26p is equivalent to a (convex) SDP. There- 
fore, (|26p being convex provides that the optimal primal value of (|26p and that of its Lagrangian 
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dual are equal. In other words, the duality gap is zero and strong duality holds for (|26p provided 
Slater's condition is also satisfied. Define dual variables {A^ n } and let A denote the vector formed 
by collecting all such variables. Consider the following partial Lagrangian 



L({* m ,f3^}, A) ± £ + £ £ A*, n 091 



k,m 
m,n 



ok,n \ 
t J m,nl 



(27) 



and the dual function 



5(A) 



mm 



{# m ,/3( m )} 



s.t. 



I* II 2 . 



Iflfc* w k l-e fe II' 

VI m.m ml m.ml 



k,m 
m,n 



ok,n x 
h>m,n ) 



max Afc /i m , m * m , fe (* m , fc ) w (^ m , m ) J;f +En #m («™) 2 +l 

max fcJ * m ($ m ) H (/i4 ) H < (/3£,m) 2 Vj,n / m 
Vm 



> a Vfc, m 



(28) 



I* II 2 < P 

|^m||2 — r n 



Notice that the dual problem splits into M smaller problems of the form 



mm 



*m,{/3( m >} 



S.t.- 



I ^m-m"" 



I tit? Ho 



max fc m V m fc <W m 

\\<t> II 2 . < P 

|| II 2 — wi ■ 



> a Vfc 



(29) 



Using the arguments provided in the preceding sections, each of the smaller problem in (j29|) can be 
shown to be equivalent to an SDP. Invoking the strong duality, we can recover the primal optimal 
solution by solving the dual problem max.\{g(A)}. The latter problem can be also solved in a 
distributed manner via the sub-gradient method. In particular, suppose {/3^ "*} are the optimized 
variables obtained upon solving the decoupled optimization problems in (|29p . Then the dual vari- 



ables can be updated using a sub-gradient as A^ n — > A^ n + /J,0m™ — ^m%), V k,m ^ n, where [i 



is a positive step size parameter. Note that updating A^ n involves exchanging (3m™,/3m™n 



"m,n 



between 



BSs m, n, respectively. Finally, we note that a speed-up can be obtained at the cost of some sub- 
optimality by forcing equality after a few steps of the sub-gradient method. In particular, we can 

for all k, m ^ n and then concurrently optimize the 



set both f3^,/3^ n to be equal to 



M decoupled problems in (I26p over {<& m }. The current choice of a is declared feasible if and only 
if all of the problems are feasible. 



17 



Algorithm 2 - Distributed Robust Max-Miu SINR Optimization 



1: for m = 1, . . . , M do 

2: Input P m 

3: -B m initializes <f> m = 

4: end for 

5: Using * m , {W„}„^ m , each B m computes sinr^, V I 

6: repeat 

7: for m = 1, . . . , M do 

8: B m solves 6>i im (P) and obtains <fr* n ; 

9: B m broadcasts = *^(** n ) H 

10: Each B„, n ^ m, computes sinrjf, V Z based on W* m , <&„ and {Wj}j^ m , n 

11: if minjjsinr^*} < min/jsinr^} then B n sends an error message to B m 

12: if B m receives no error message then it sets <& m «— 3>*„ and broadcasts an update message 

13: Upon receiving the update message each B„, n ^ m, sets W m <— W* n and updates sinrj t , V I 

14: end for 

15: until no further precoder update is possible 

16: Output {*,„} 



and broadcasts W r 



H 



5 Robust Weighted Sum-rate Optimization 
5.1 Full Cooperation 

By recalling the definitions in ([5]) and taking into account that the uncertainty regions corresponding 
to SINR^ and SINR^ for m ^ n or I ^ k are disjoint, finding the worst-case SINR for each user can 
be carried out independently of the rest. Hence, by recalling that the worst-case SINR of user 
is denoted by sinr^, the problem 7Z(P) is given by 



K(P) 



max {$m} £m=l Ef=i < log(l + sinr* ) 



S.t. ||*m||| < Vm. 



The problem TZ(P) as posed above, is not a convex problem. Optimal precoder design based on 
maximizing the weighted sum-rate even when the BSs have perfect CSI is known to be an NP hard 
problem. Thus, even in this case only efficient techniques yielding locally optimal solutions |13j 
can be obtained. Clearly, the robust weighted-sum rate problem is also NP hard and hence good 
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sub-optimal algorithms are of particular interest. Here we leverage an approach developed in 
(which considers the single-cell scenario and designs a provably convergent algorithm yielding locally 
optimal solutions) and propose a suboptimal solution by obtaining a conservative approximation 
of the problem 1Z(P). This approximation provides a lower bound on 1Z(P). 
To start we define the set of functions {S^u) : M. — > M} as 

Sm( u ) = a m u ° m h exp(u - 1), for m = 1, . . . , M, k = 1, . . . , K, and 

1 + sinC 

It can be readily verified that we have 

maxS'^('u) = log ( 1 + sinr^J and u* = argmaxS^(u) = log ( 1 + sinr^ ) +1 Vfc,m. (31) 

Therefore, by incorporating the slack variables u = [u^] and substituting the objective function 
of TZ{P) with its equivalent term X^m=i J2k=i max 'u* l the problem 1Z(P) is equivalently 

given by 

^(p) = / max {«M>" ^m=l EjLi «m " rffe ex P( n - " *) (32) 
[ S.t. Il*m||| ^ Vm. 

For any fixed u we define the intermediate problem TZ(P, u) which yields the optimal precoders 
{<& m } corresponding to the given u and power budget P. Since for a given u the term Yl,m Efc a m u m 
becomes a constant we get 

' minr* | T,™ T*L, exp(u* - 1) 

s.t. ||*m||i ^ Vm. 

The problem 1Z(P,u) can now be transformed into a weighted sum of the worst-case MSEs as 
follows, 

K{P u) = I min{ * m} ^™ =1 a ™ exp ^™ " X) max {A^,„} min /4 M $ E ™ (34) 
s.t. ||*m||| < p m Vm. 

Next, for any given it we find an upper bound on lZ(P,u), which by recalling (j32|) and ()34|) is 
a lower bound on 1Z(P). By invoking the inequality in (flUj) and defining / m = [f^]k, an upper 
bound on 1Z(P, u) can be found as follows, 

U(P U) = { mill{ * m - f ™> ^™ =1 a ^ eXP ^™ " ^ maX {A^,„} M S E m (35) 

s.t. ||*m||i — Pm Vm. 
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Algorithm 3 - Robust Weighted Sum-rate Optimization 



1: 



Input P and {h mn ,e% ln } 



2: 



Initialize 3? m , f m for all m and u 



3: 



repeat 

Solve TZ(P,u) by optimizing over {«fr m } and {/^} in an alternating manner; 
Update uf n <— 1 — log f max{ A k j MSE TO ^ , V fc, m 



4: 



5: 



6: until convergence 
7: Output {* m } 

TZ(P, u) can itself be sub-optimally solved by using the alternating optimization (AO) principle 
and optimizing {f m } and {<& m } in an alternating manner. By deploying AO we can optimize 
{<& m } while keeping {f m } fixed and vice versa. Since the objective is bounded and it decreases 
monotonically at each iteration, the AO procedure is guaranteed to converge. In the following 
theorem we show that solving TZ(P, u) at each step of the AO procedure is a convex problem with 
a computationally efficient solution. 

Theorem 6 For arbitrarily fixed {<l? m } ; the problem TZ(P,u) can be optimized over {f m } effi- 
ciently as an SDP. Similarly, for arbitrarily fixed {f m }, the problem TZ(P,u) can be optimized 
over {<& m } efficiently as another SDP. 

Proof: See Appendix El ■ 
Algorithm 3 summarizes the steps required for sub-optimally solving 1Z(P). This algorithm 
is constructed based on the connection between the objective functions of 7Z(P) and 7Z(P,u). 
At each iteration of Algorithm 3 for a fixed u, 1Z(P, u) is solved by using the AO principle as 
discussed above and a new set of precoders and equalizers is obtained. The minimum rate achieved 
by using this set of precoders and equalizers provides a lower bound on 7Z(P). This set of precoders 
and equalizers is also deployed for computing the worst-case MSEs and updating u as = 1 — 



it. Therefore, the utility function of Algorithm 3 is bounded and increases monotonically in each 
iteration. Thus, convergence of Algorithm 3 is guaranteed. 

2 Note that the worst-case MSEs can be computed using the techniques given in Appendix [Dl 




, V k, mlo Since TZ(P) is bounded from above, so is any lower bound on 
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5.2 Distributed Implementation 

An advantage of the AO based approach employed to sub-optimally solve TZ(P) in the previous 
section is that it is amenable to a distributed implementation. In particular, note that for fixed 
u, {/ m }, the optimization over {<& m } decouples into M smaller problems ()60|) that can be solved 
concurrently by the M BSs. Similarly, for fixed u, {3> m }, the optimization over {/ m } decouples 
into KM smaller problems (|62|) that can be solved concurrently. Finally, for a given {<& m , f m } 
the elements of u can also be updated concurrently. Consequently, Algorithm 3 can indeed be 
implemented in a distributed fashion with appropriate information exchange among the BSs. 



6 High snr Analysis: Degrees of Freedom 

In this section, we analyze the high SNR behavior of the robust max-min rate and the robust 
weighted sum rate. We suppose that for each user H/i^ml! > e mm an d that the power of the m th 
BS scales as "fP m , V m, where 7—7-00. Then, for any given choice of precoding matrices {<& m }, we 
examine the worst-case sinr given in (|12p . Notice that by choosing any particular error vectors from 
their respective uncertainty regions, we can obtain upper-bounds on the worst-case sinrs {sinr^}. 
In particular, to obtain an upper-bound on sinr^, for each interfering BS n / m, we select a user q 
and choose an error vector A^ n that maximizes n Wn\ 2 - Similarly, for BS m we select a user 
q 7^ k and choose an error vector A^ m that maximizes m Wm\ 2 . With these particular { A^ n } 
and some algebra, we obtain an upper bound given by 



\h k || + e fe \ 2 \\w k II 2 

I m^rri II 1 m,m/ II mil 



1 /n„..i 1 



(36) 



where = Wn/\\wn\\, Vg,n and 



f(h k m , n ,wl, {wi}^ q , e k m , n ) = {\h k m>n wl\ + 4,„) 2 ||<|| 2 + X)(l^™,>«l - ^Jwi H w q n \) 2 \\wi\\ 2 . (37) 
Then, we can optimize over the choice of q to obtain a tighter upper bound given by 



(A,m\\ + 4,m) 2 |kml| 2 



(38) 



1 + max&ki f( h m,m: "IWiW.d.J} + E„#m maX «{/(^m,n. W n, {™n}i^, <4,J} 

Note that the upper bound in (|38p is tight when we have perfect CSI, i.e., when all e^ n = 0. A 
simpler upper bound that suffices for a result proved in the sequel is given below. 



~k 

„k ub 

c k \2 II || 21 1 V" 1 /HI. I -L- c fc ~\2||„„9 ||21 



(ll^m,mll e m,m) ll^mll 

smr ™ ~~ "T^fc — tt~^ — ; — ~ n „„, — ~ tt^i — ~ — : — ~ — —■ y 6S) > 



1 + m&n q ^ k {{\h mm w q m \ + e^ jm ) 2 ||w^|| 2 } + En^m m ax g {(|/i TO ,„w«| + J 2 ||ii^|| 2 } 
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The following theorem characterizes the high SNR behavior of the robust max-min rate and the 
robust weighted sum rate. 

Theorem 7 Suppose that n > 0, V k, m, n and the vector of transmit powers is scaled as jP. 
Then, the minimum worst-case SINR 5( 7 P) as well as the worst-case weighted sum rate TZi^fP) are 
monotonically increasing in 7. Moreover, we have that 

lim sup ShP) < 00 & lim "^ 7 /? = max{a„}. (40) 

7 ^5o 7^oo log( 7 ) k,m 

Proof: First, we note that for any given vectors {w!^} and any choice of channel vectors, we have 
that 



y\h k w k I 2 



— k ( 41 ) 

E/^fc l\h-m,m wl m\ 2 + En^m En\ h m,n w n\ 2 + 1 

is monotonically increasing in 7 > 0. Consequently, we have that each of the worst-case sinr given 
in (|12p must increase when the beamforming vectors are scaled as {y/^yw^}. Thus, the mini- 
mum worst-case sinr S{^P) as well as the worst-case weighted sum rate IZ^P) are monotonically 
increasing in 7. Next, we note that sinr^ ub in ()39f) can further be upper bounded as 

■ k ub < (ll^m.mll + £ m,m) IK^mJI /,r,x 

m - i + E^«J 2 ll^ll 2 /(^-i) + E^ m E g «J 2 ll^ll 2 /^' 1 ' 



The RHS in (|42p corresponds to the SINR in a fully connected MK— user Gaussian interference 
channel (GIC) with constant channel gains and where each source and each receiver have a single 
antenna. For this GIC it is known that the symmetric rate saturates as the transmit power of each 
source is simultaneously increased and that the total degrees of freedom is one |24j . The results in 
PO"]) now follows. ■ 
The result in Theorem [7] must be contrasted to the interference-alignment results that have 
recently been developed (cf. [24]). We recall that the key idea behind interference alignment is to 
ensure that the all the interference seen by a user (receiver) aligns itself (i.e., confines itself to a 
sub-space) so that the user sees the remaining subspace as interference free. Such an alignment 
based solution possesses a first-order optimality at high SNR in the case of perfect CSIT. Theorem 
[7] shows that such an alignment is not possible over our model due to the uncertainty involved in 
the available CSIT. Thus unlike the perfect CSIT case no simple high-SNR alignment solutions are 
possible and consequently algorithms that consider robust optimization at finite SNRs remain the 
only viable options. 
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Next, we consider another good albeit sub-optimal approach for selecting the beamforming 
vectors. We consider the worst-case signal-to-leakage- interference-plus-noise ratio (SLINR). The 
SLINR metric has been shown to be an effective metric over networks with perfect CSI |13|I25|. In 
particular, the worst-case SLINR corresponding to user k in cell m is given by, 

\h k m k I 2 

shnr™ = mm ; - ; 43 

/A* \V I ft-? in k \ 2 4- V V \h l ni k I 2 -I- 1 
X^m,nS Auj^k\"'m,m UJ m\ ' Z^in^m L^il \"'n,m UJ m\ ^ J- 

Notice that the uncertainty regions in the numerator and denominator are decoupled so that 

mm j A fc i |^m,m w m| 2 

I" K I "m.m J ' / a a \ 

slinr^j = ; — — ! - t — (44) 

Hj^k maX {A^. m } \^ l m,m W m\ + En^m Yll maX {A^ m } l^n,™ 11 '™! + * 

Suppose that a per-user power profile {-P^j} nas been given. Then, the beamforming vectors can 
be independently designed by solving 



max slinr m 



k 

(45) 



s-t- ||u&||! < Pi V k,m. 



The maximization problem in (|45|) can be exactly solved by alternatively solving a power optimiza- 
tion problem in conjunction with a linear bi-section search as described in (|14p and (|15p . 



7 Simulation Results 

In this section, we provide simulation results to assess the performance of the proposed algorithms. 
Both robust max-min rate and robust weighted sum-rate problems as discussed in Sections U] and 
[5] can be posed or conservatively approximated by semidefinite programs. Therefore, we use the 
software package SDPT3 developed as a MATLAB toolbox for solving semidefinite programs [26J. 

We consider a network consisting of two cells (M = 2) and each cell with two users (K = 
2). Each BS is equipped with two transmit antennas (N = 2) and each user has one receive 
antenna. Also we consider identical power constraints for all BSs and set P m = 10 dB for m = 1, 2. 
Fig. Q] compares the optimized minimum worst-case rates (robust max-min rates) achieved by full 
cooperation (Algorithm 1) and limited cooperation (Algorithm 2), respectively, among the BSs. 
Note that the performance yielded by Algorithm 1 can also be achieved in a more distributed 
manner as described in Section 14.31 We consider 20 channel realizations and for each realization we 
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Figure 1: Comparing the robust max-min rates obtained with centralized and distributed precoder 
designs over 20 independent channel realizations and different uncertainty regions. 

solve the robust max-min rate problem under four different setups. As the baseline we consider the 
fully cooperative scenario with perfect CSI, i.e., e^ nn = for all k,m,n. For perfect CSI, the SINR 
lower bound given in (|13p becomes the exact SINR, i.e., sinr^ = sinrj^ = SINR^ and therefore the 
solution of Algorithm 1 is the optimal solution. We normalize the robust max-min rates obtained 
in different scenarios by this optimal robust max-min rate. Next, we obtain the robust max-min 
rates for different uncertainty regions with radii e\ nn = 0.05 and 0.1 for all k,m,n. It is observed 
that larger uncertainty regions result in smaller robust max-min rates, which is expected. Finally, 
we assess the robust max-min rate obtained by the distributed algorithm with limited cooperation 
(Algorithm 3), where each BS updates its precoder unilaterally and the radii of the uncertainty 
regions are e^ n = 0.1 for all k,m,n. For some channel realizations, the solution obtained by 
this distributed algorithm is precisely equal to that of the algorithm with full cooperation. For 
most realizations, however, Algorithm 2 exhibits degraded performance compared to Algorithm 
1 . This is the cost incurred for the benefit of having limited cooperation between the BSs. In 
Fig. [2] we consider the same setup as in Fig. Q] and compare the relative performance yielded by 
the two distributed algorithms which solve the robust max-min rate optimization problem through 
power optimization and MSE optimization. According to the simulation results, neither of these 
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Figure 2: Comparing the robust max-min rates obtained with the two proposed distributed algo- 
rithms (power optimization and MSE optimization) over 20 independent channel realizations and 
different uncertainty regions. 

two algorithms consistently outperforms the other one. Also, it is observed that for any channel 
realization, the performance of the better one is almost close to that of the situation when the 
perfect CSI is viable. 

In Fig. [3] we consider the same network setup and also for convenience set the rate weighting 
factors equal to 1, i.e., = 1 for all k,m. Similar to Fig. [IJ we examine the uncertainty regions 
with radii e^ n = 0, 0.05 and 0.1, respectively and plot the achievable robust weighted sum-rates 
which are determined using Algorithm 3. The relative performance of different settings are identical 
to those obtained in Fig. [TJ The key observation is that the performance degradation due to larger 
uncertainty regions is smaller for the robust weighted sum-rate problem than that seen in the 
robust max-min rate case since the former is less vulnerable to the undesired CSI noise and hence 
is expected to degrade more gracefully as the uncertainty regions expand. 

Fig-Slplots the robust max-min rate (achieved using Algorithm 1) versus SNR. Here, we consider 
three different network settings; one with three cells (M = 3) each with three users (K = 3), one 
with two cells (M = 2) each with two users (K = 2), and finally one with four cells (M = 4) each 
with (K = 10) users. In the first two settings we assume that the number of transmit antennas per 
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Figure 3: Comparing the robust weighted sum-rates obtained over 20 independent channel realiza- 
tions and different uncertainty regions. 




Figure 4: The robust max-min rate versus SNR (dB) for different levels of uncertainty. 
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Figure 5: Comparing the minimum worst-case rates yielded by the robust beamforming and zero- 
forcing beamforming designs for M = N = K = 3. 

BS are N = 3 and in the third one we assume N = 4 transmit antennas per BS. It is observed that 
for each fixed uncertainty region, there exists a considerable gap between the robust max-min rates 
of the settings M = K = 3 and M = K = 2 as well as between that of the settings M = 10, K = 10 
and M = K = 3. This is due to the fact that independent messages are transmitted to different 
users, and therefore increasing the number of users from 4 to 40 increases the amount of the 
interference imposed on each user, which in turn degrades the quality of the communication for all 
users. Moreover, as the number of users increases, the likelihood that the weakest user suffers from 
a very weak communication quality increases. Further, as predicted by Theorem[71 for each setting 
the robust max-min rate saturates at high SNR. We note that the saturation of the optimized 
min rate at high SNR even in case of perfect CSI can be deduced from [23]. In particular, we can 
infer from the results in [241 that assigning one degree of freedom to each user is not possible for 
any of these three settings^ Moreover, from Fig. [5] (which considers M = K = N = 3) we see 
that the optimized robust designs yield a substantial improvement in the minimum worst-case rate 
compared to the naive zero-forcing strategy, wherein each BS designs beam vectors for its in-cell 
users under the assumption that it alone operates in the network and that the channel estimate 



3 Note that assigning a fractional degree of freedom to any user is not possible with our model since we do not 
allow precoding (beamforming) across multiple time and/or frequency slots. 
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SNR (dB) 

Figure 6: The robust sum-rate versus SNR (dB) for different levels of uncertainty. 

vectors available to it are perfect. Each BS performs the zero-forcing operation on its channel 
estimate vectors and then does power allocation to maximize the minimum rate among its in-cell 
users. 

Fig. [6]depicts the optimized worst-case sum-rate (robust sum-rate) for the same network settings 
and uncertainty regions as in Fig. 2J Note that unlike the robust max-min rate, the robust sum- 
rate must increase with the network size. Also, for each setting the optimized worst-case sum rate 
saturates at high SNR while Theorem [7] predicts that the robust sum-rate has one degree of freedom, 
i.e., scales as log(SNR). This is due to the simple albeit sub-optimal AO technique employed wherein 
conservative bounds are optimized at each step. However, with perfect CSI we obtain a positive 
total degrees of freedom. For the model at hand with M = K = 3 (M = K = 2), using the 
results in [23] an upper bound on the total degrees of freedom can be computed to be 4.5 (3), after 
assuming perfect in-cell user cooperation and allowing for precoding across multiple time and/or 
frequency slots. This upper-bound needs not be achievable and the total degrees of freedom we 
observe from the plot is 3 (2). Similarly we observe that for the setting M = 4, K = 10 the number 
of degrees of freedom is 4. Next, in Fig. [7J we consider the setting M = K = 3 and compare 
the worst-case sum-rates yielded by the robust designs, the naive zero-forcing strategy and the 
SLINR beamforming. Note that in the latter two cases each BS computes its beamforming vectors 
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SNR (dB) 

Figure 7: Comparing the worst-case weighted sum-rates yielded by the robust beamforming, zero- 
forcing, and SLINR-based beamforming designs for M = N = K = 3. 

independently. In particular, the beam vectors were obtained in the zero-forcing case as described 
for the example in Fig. [5j except that the power allocation is done to maximize the sum-rate. On 
the other hand, the beam vectors were obtained in the SLINR case as described in Section [6] for 
each power profile and an exhaustive search was conducted over power profiles to maximize the 
sum SLINR. Note that gains obtained by the robust and SLINR-based designs over the zero-forcing 
design are significant and commensurate with the extent to which they account for the interference 
seen from other sources and that imposed on other users. 



8 Conclusions 

We have considered designing robust precoders for multi-cell multiuser downlink systems when the 
BSs can acquire only noisy channel estimates. To account for the uncertainty about the channel 
states we adopt the notion of worst-case robustness and aim at maximizing the network minimum 
rate and a weighted sum-rate of all users for the worst-case estimation perturbation. Depending on 
the level of cooperation among the BSs, algorithms with full cooperation and limited cooperation 
are offered for designing the precoders. The precoder design problems can be either posed as convex 
problems, or conservatively approximated by some convex problems. All convex problems are shown 
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to have computationally efficient solutions. Table [T] summarizes the main proposed algorithms. 



Problem 


Optimality 


Complexity 


Type 


Robust max-min rate (K = 1) 


optimal 


SDP 


centralized 


Robust max-min rate via power optimization (K > 1) 


suboptimal 


SDP 


centralized 


Robust max-min rate via MSE optimization (K > 1) 


suboptimal 


GEVP 


centralized 


Robust max-min rate via power optimization (K > 1) 


suboptimal 


SDP 


distributed 


Robust max-min rate via power optimization (K > 1) 


suboptimal 


SDP 


distributed 


Robust weighted sum-rate (K > 1) 


suboptimal 


SDP 


distributed 



Table 1: Summary of the algorithms. 



A Proof of Theorem [T] 

Let us denote the set of precoders obtained from solving S(P) by {w^} and their corresponding 
worst-case SINRs by sinr^. Prom the definition of S(P) we have 

\\w* II 2 

" m " 2 < 1 Vm, and min{sinr* } = S(P) => sinr* > S(P) Mm. 

Pm m 

From the definition of V(P, a) we find that for the choice of {w^}, the choice of b = 1 is achievable 
for V(P,S(P)) and therefore V(P,S(P)) < 1. 

Next we show that V(P,S(P)) cannot be less than one. Let us denote the set of precoders 
obtained by solving V{P ,S{P)) by {w™}. From the definition of V(P,S(P)) we clearly have 

II ID** II 2 

sinr^ > S(P). If V(P,S(P)) < 1 i.e., if max m p 2 = c < 1, then we define the set of pre- 
coders {w m } = {-iu^/i/c}. {w m } clearly satisfy the power constraints and moreover for their 
corresponding worst-case SINRs from (jlOj) we have 



1 

c 


(\hm,m'W m \ ^'m,rn\'^m 1 1 2 ) 


2 


(\h 


m,rnWjxi \ ^m,m)\'^ jji H2) 


2 


1 T 

c t—m 




\h-m,n1Mn* \ ^m,n \\^n* II 2 


2 > 

+ 1 En 




I h m n W^ | -|- € m ^n \\^n* II 2 | 


2 + l 



since c < 1. Therefore, we have found a set of precoders {w m } which satisfy the power constraints 
and yet yield a strictly larger robust max-min SINR compared to what the precoders {w^} obtain. 
This contradicts the optimality of {w^} and therefore T , (P,S(P)) = 1. The strict monotonicity 
and continuity of V(P, a) in a, at any strictly feasible a, follows from a similar line of argument. 
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B Proof of Theorem [2] 

By considering the characterization of sinr m given in (jlOp . the constraint sinr m > a provides that 
Vm, 3t m £ M + such that 

ID+I > at 2 m , (46) 
IWl+l < t 2 m . (47) 

Next, note that the optimal solutions are insensitive to any phase shift. In other words, if is 
an optimal solution, then w^e^ is also an optimal solution as such phase shifts do not alter the 
objective or the constraints of V(P, a) given in (jlip . Among such optimal solutions we select those 
for which h m>m w m ^ m has a non-negative real part and a zero imaginary part. Therefore, (|46p 
can be restated as 

£m,m 1 1 l^m II — h m m W m \f~Q&rm 



which is a second-order cone (SOC) constraint. In this context, we note that the useful step in (|48|) 
was first developed in [20], wherein an inequality of the form \hw\ — e\\w\\ > 1 was expressed as a 
convex constraint. 

Furthermore, by introducing the additional slack variables {c m>n }, {d m , n }i and {e mjn }, corre- 
sponding to the terms \h m , n w n \, e m)Tl ||iu n ||, and \h m>n w n \ +€ m ,n\\ w n\\, respectively, we can express 
the constraint in (|47p equivalently by 



|h m ,n™n| < C m>n Wl / 77J 

which are all SOC or linear constraints. By defining t = [ti, . . . ,£m] 5 c = [ c i,i • • • ,cm,m], d = 
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[di i . . . , cIm,m], and e = [en . . . , &m,m\ we can reformat "P(-P, a) as follows. 



m ^ n {w m },c,d,e,t,b b 



s.t. 



«?mll < ^ 



c m,n 



+ 1 < t m Vm 



\h , m,n'Wri\ ^ C"m,n 
tm,n\\Wn\\ < d m . n Vn / m 

< 6 Vm 



(50) 



||Wri 



Note that all the constraints above are linear or second-order cones and the objective is linear in 
b. Therefore, V(P ', a) is an SOC program and hence can also be expressed as an SDP. 

C Proof of Theorem |4] 

Similar to the proof of Theorem EJ the constraint sinr^ > a provides that Vm, A;, 3 E M + such 
that 



a(t 



k \2 



< 



Wm 2 



(51) 



and {t k m f > max h k m ^ m , k {* m , k ) H {h k m ^ m ) H + ^ max h k m ^ n {3> n ) H (h k m J H + 1.(52) 



A fc 



A fc 



Next, note that the optimal solutions are insensitive to any phase shift. In other words, if &* m is 
an optimal solution, then <&* n x diag(e je ™, . . . , e j£ ™) is also an optimal solution as such phase shifts 

do not alter the objective and the constraints of Vi(P, a). Among such optimal solutions we select 

~ k 

those for which h m m w^ n has a non-negative real part and a zero imaginary part. Therefore (|5ip 
for t k > can be stated as 



ck \\w k , 

~m,m II "'m I 



€t>1 r>1 I I TYi I | 2 f^nm <>->i li? 



which for a given a is a second-order cone (SOC) constraint. Next, by introducing the additional 
slack variables {e^ n }, the other constraints in ([52]) can be written as 

(^m) — ( e m,m) + ^ ] ( e m,n) ^ 1) 
n^m 



and (d m ) 2 > m * mjfc (* mife ) H (hi m ) H , V A fc 



• IIA fc II < lie* II 
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and (e mn ) > h mn 'f> n ('f> n ) (h mn ) , V A mn : || A mn || < ||e 



k 

m,n II • 



(53) 



We now demonstrate that the constraints in (|53p can be transformed into finitely many linear 
matrix inequalities. By applying the Schur Complement lemma [27], the constraints 

h k * (& ) H (h k ) H < (e k ) 2 VllA fc II < e k 



can be equivalently stated as 



,~k 



■"m,n 



Next, the following lemma proved in [18] (also used in [16]) is instrumental for transforming the 
constraints above into finitely many linear matrix inequalities which account for the uncertainty 
regions by deploying the additional slack variables A = [X-m n]k,m,n- 

Lemma 2 For any given matrices A, B, and C with A = A H , the inequality 



k )H 



(h m ,n + &m,n)®r 
e k I 



^ V IIA fc II < e' 



AhB H DC + C"D"B VD : \\D\\ < e, 



holds if and only if 



3A > such that 



A - \B H B -e C H 
-eC XI 



y 0. 



By setting B = -[1 0], C = [0 * n ], D = A^ n , and 



"'m,n^n 



and applying Lemma we find that the constraints n <& n (<& n ) H (h 1 ^ n ) H < (e^ n ) 2 for all 



3 fc \2 



l A m,nll < e m,n are equivalently given by 



-,k a 



e K - \ K 

, 7 k 



{^n) H {h m n ) H e m,nl e m,n(^'«) 



H 







>: V/c, m ^ n, 



(54) 
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which is a linear matrix inequality (LMI). Similarly we can show that the constraints m ^ m fc||| < 
( e m,m) 2 for a11 II A m,m II < e m,m are equivalently given by 



U 



m.m 



,k 

■"m.m 



h"m,rri* m,k 



(^m.k) (^m.m) 





e k I 





»(*m,*) H 



^m,m^m,f 



\K J 



>- Vm, fc. 



By defining t = [t^\k,m an d e = [ e mn]> 'Pi(Pt a ) can be cast as follows. 



min {#m}iAit>ei6 b 



s.t. 



r A' 



w m \\ 2 < h w m - ^at m Vm, k 



^2n^m( e m,n) 2 + ( e m,m) 2 + 1 — *m Vm, & 



m,n 
rk 



y_0 V fc,m n, 



||*m||2 < ^v 7 ^ Vm. 

which has linear objective and semidefinite or second-order cones and therefore is an SDP. 
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D Proof of Theorem 



By recalling ([3]) and further defining the slack variables {b^^}, the constraints {MSE^ < a 2 } can 
be equivalently presented as follows. 



En(^,n) 2 + 1 < fta 

fm e k\\ < b: 



\h k <t> 



k 

m.m 



Vm, k 

Vm,fe, V||A fc 



„ < e k 



(56) 



\h k * II < h k 



Vk,m^n V || A* || < e* 



where we let denote a length K unit vector having a one in its k th position and zeros elsewhere. 
Without loss of generality we have assumed € K + as multiplying the vectors with any unit- 
magnitude complex scalar will not change the objective or the constraints of the problem 52 (-P). 



Next, by applying the Schur Complement lemma the constraints \\h mm & m 



f m e k \\ < b k for all 



|A ft II < e k 



can be equivalently stated as 



(^m,m + ^m,m)^rn fm e k 



fk p 
Jm^k 



b k I 



y V A K < e K 
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which upon using Lemma [2] with B = -[1 0], C = [0 * m ], -D = A^ m , and 



V*m) V-m^m) Jm^k 



~ k k 
b k I 



are equivalently given by 



-.fc - 



Lfe _ \k 

) H (h k ) H - f k e H 

(^m) \""m,rn) Jm c k 





h"m,m^rn frn e k 



b k I 



\k j 



t Vm, k. (57) 



Similarly we can show that the constraints ||fa^ n <l? n || < b k nn holding for all ||A^ n || < e k m n are 
equivalently given by 



U 



k 







(^n) (^-m,n) t> mn I € m ^ n (<S> n ) 

Q _ c k rf« \k 



H 



— f k (h \k T 



y Vto, 7^ n, k. 



(58) 



Finally, note that the constraint y^J^J 2 + 1 < fmO, is equivalent to yj, + f^al H,Vm, A;, 
where 



rk ' 



(b k J H 
1 

Consequently, the problem S2(P) is equivalent to 



b k m 1 



, V TO, /c. 



(59) 



mm {# m ,/4},6,A,a ° 
S.t. 



T k m h0, Wk,m, 



\\^m\\l<Pm VTO, 



which is a standard form of GEVP [5]. 
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E Proof of Theorem [6] 

We first show that for any given and fixed {/^.}, the problem TZ(P, u) is equivalent to an SDP. We 
define = a^exp(u^ — l)/|/m| 2 an d see that 

M K M K / \ 

^ ^ «j eXp(M m - l)MSE m = 9m|frm,m«>l - /m| 2 +5ZgL|^n,m^m| 2 + ^Z^il^.m^ml 2 J 

m — 1 fc — 1 m— 1 fc = l \ i^fe n^m Z / 



=s(™m) 

Clearly, the optimization of TZ(P, u) now decouples into M optimization problems of the form 

s.t. max {A ! n m} d( w ™) < & ™ ( 60 ) 



- m ||2 



Using the techniques employed in the proofs of Theorems 4 and 5, we can verify that the constraints 
can be equivalently expressed as finitely many LMIs so that the optimization problem is equivalent 
to an SDP. Next, suppose {^m} are arbitrarily fixed. Then TZ(P,u) reduces to 

M K 

^EE ™^™- 1 ) "P* M ^ E m ( 61 ) 
m=l fc=l 1/*m,nJ 

The above optimization problem decouples into KM smaller problems of the form 

min max MSE„ (62) 

fk Ui 1 

Substituting g k m = in {62]), we can optimize instead over and the latter optimization 

problem can be readily shown to be equivalent to an SDP by using the techniques provided in |16| . 
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